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ABSTRACT 

Recent studies emphasize that an empirical relation between the stellar mass of galaxies and 
the mass of their host dark matter subhaloes can predict the clustering of galaxies and its 
evolution with cosmic time. In this paper we study the various assumptions made by this 
methodology using a semi-analytical model (SAM). To this end, we randomly swap between 
the locations of model galaxies within a narrow range of subhalo mass (Mi n f a it). We find that 
shuffled samples of galaxies have different auto-correlation functions in comparison with the 
original model galaxies. This difference is significant even if central and satellite galaxies are 
allowed to follow a different relation between Mmfaii and stellar mass, and can reach a factor 
of ~ 2 for massive galaxies at redshift zero. We analyze three features within SAMs that 
contribute to this effect: a) The relation between stellar mass and subhalo mass evolves with 
redshift for central galaxies, affecting satellite galaxies at the time of infall. b) In addition, the 
stellar mass of galaxies falling into groups and clusters at high redshift is different from the 
mass of central galaxies at the same time, c) The stellar mass growth for satellite galaxies after 
infall can be significant and depends on the infall redshift and the group mass. All of the above 
ingredients modify the stellar mass of satellite galaxies in a way that is more complicated 
than a dependence on the subhalo mass only. By using two different SAMs, we show that 
the above is true for differing models of galaxy evolution, and that the effect is sensitive to 
the treatment of dynamical friction and stripping of gas in satellite galaxies. We find that by 
using the FOF group mass at redshift zero in addition to Mj n f a n, an empirical model is able to 
accurately reproduce the clustering properties of galaxies. On the other hand, using the infall 
redshift as a second parameter does not yield as good results because it is less correlated with 
stellar mass. Our analysis indicates that environmental processes that affect galaxy evolution 
are important for properly modeling the clustering and abundance of galaxies. 



Key words: galaxies: formation 



1 INTRODUCTION 

The current paradigm for the formation of structure in the Universe 
predicts that galaxies form and evolve inside of dark matter haloes. 
This working assumption makes the relation between the properties 
of galaxies and their host haloes a fundamental probe for various as- 
pects of galaxy formation and cosmology. Galaxy properties such 
as stellar mass, star formation rate, color, and clustering are all inti- 
mately linked to the host halo mass. Consequently, there have been 
many recent attempts to quantify the relation between the mass of 
galaxies and their host halo mass (hereafter the 'mass relation'). 

Observational co nstraints on the mass rela tion were derived 
by wea k lensing (e.g. Mandelbaum e t alj|2006h . satellite dynam- 
ics (e.g. IConrov et alj|2007l) and large group catalogs of galaxies 
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dYang et alj|2009h . All these are able to provide constraints on the 
stellar-halo mass relation for 10 12 — 1O 14 M0 haloes. At lower 
halo masses there are currently no useful direct observational con- 
straints. Nonetheless, the abundance of low mass halo es is impor- 
tant for constraining the nature of dark matter (see, e.g. lBode et al.l 
l200ll ; lMacci6 & Fontanotll2010h . 

From the theoretical perspective, there are two main ap- 
proaches for studying the relationship between stellar and halo 
masses. A straightforward way is to use existing models and extract 
the mass relation as a secondary result. This a pproach was demon - 
strated using hydrodynamic al simulations (e.g . lSawala et al.l2010l) . 
and semi-analytical models jWang et al.l2006l : lMoster et alfcOldh . 
These two methodologies result in very different mass relations: 
hydrodynamical simulations predict higher stellar mass for a given 
halo mass, especially for low mass haloes. It is not clear whether 
the origin of this discrepancy is the different set of assumptions 
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adopted by each model, the different observational data which are 
used to constrain the models, or some specific physical ingredients. 

A different approach is to use the stellar-halo mass relation as 
the main assumption, and to verify it against a variety of observa- 
tions. This approach is more useful for a systematic analysis of the 
mass relation, and for carefully testing the necessary ingredients 
needed to explain it. The halo occupation distribution (HOD) was 
historically the first approach to follow this l ine of thought (see 
iBerlind & Weinberg! |2002| ; iTinker et alj|2005l : IZehavi et al JI2005L 
and references therein). It assigns per each halo a specific number 
of galaxies of a given type. By tuning the parameters of the number 
and location distributions, these models are able to reproduce the 
abundance and clustering properties of galaxies. Thus, the relation 
between haloes and galaxies can be constrained using a relatively 
simple set of assumptions, and the halo mass is a sufficient param- 
eter for fixing the properties of its galaxies. 

Various studies have shown that the HOD approach can 
be simplified even more using informati on on the substruc 
ture within haloes, i.e., the subhalo mass jKravtsov et al 
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halo can host only one galaxy. The galaxy mass is linked to the 
subhalo mass at the last time the subhalo was central within its 
FOF group (hereafter Minfaii). The specific relation between stellar 
mass and Mi n faii is then fixed by matching the abundance of galax- 
ies and Minfaii. This method is therefore termed 'abundance match- 
ing' (ABM; models based on the ABM approach will be termed 
ABMs in what follows). Using subhalo samples from large cos- 
mological iV-body simulations, ABMs are able to reproduce the 
auto-correlation function of galaxies surprisingly well. The main 
difference between the HOD and the ABM approach is thus that 
HOD uses the FOF mass for determining stellar masses, while the 
ABM approach uses the subhalo mass, which is limited to contri- 
butions from smaller scales of dark matter. 

By fixing the stellar mass according to Mi n f a n only, ABMs 
assume various simplifications to the statistics of galaxy formation. 
First, they neglect the redshift where the galaxy became a satellite. 
This redshift should affect the stellar mass because the mass rela- 
tion evolves with redshift. Second, the stellar mass gained by satel- 
lite galaxies after infall is assumed to be independent of clustering. 
Third, ABMs do not a llow a galaxy's lar ge-scale environment to 
modify its stellar mass dCroton et alj2007T) . However, it seems that 
these simplifications do not force the models to violate the cluster- 
ing properties of galaxies. It is not clear if this is due to the low 
sensitivity of the auto-correlation function to these assumptions, or 
because multiple effects conspire to cancel each other and leave the 
auto-correlation function unchanged. 

In this paper we will test the ABM assumptions using detailed 
semi-analytical models (hereafter SAMs). We will test whether the 
stellar mass function and clustering of galaxies in the SAMs de- 
pend only on Mi n f a n. It will be shown that SAMs do not follow the 
above assumptions made by ABM models. The inclusion of more 
detailed physics, and the fact that SAMs follow the evolution of 
each individual galaxy, affects the auto correlation functions of the 
model galaxies. The success of ABMs thus poses important ques- 
tions on the processes which govern galaxy formation. Is it possible 
that the simple ABM approach mimic the observed clustering bet- 
ter than the more sophisticated models? Or is their success purely 
a coincidence? 

In order to compare ABMs to SAMs we will use the technique 
of shuffling betwee n the location of m odel galaxies. This method 
was introduced by ICroton et al .1 ( 120071) . as a tool to quantify the 



effect of large-scale environment on the stellar mass of galaxies. 
These authors found that environment of the host FOF halo can 
contribute up to ~ 10 per cent variation in the correlation func- 
tion of galaxies. Here we adopt a similar technique in order to test 
the role of Mi n f a ii in shaping the stellar mass of galaxies and their 
correlation function. The eff ect we find is an or der of magnitude 
larger than the one found bv lCroton etal] j2007h at least for high 
mass galaxies. This is because Croton et al. (2007) test the HOD 
model, and not ABM, which means that they shuffle galaxies be- 
tween FOF groups and not between subhaloes. 

One of the targets of this paper is to suggest possible ingredi- 
ents for more detailed ABM studies. We would like to find which 
ingredients might contribute to the clustering of galaxies, inde- 
pendent of the specific model chosen here. This will allow future 
ABM studies to fully explore the parameter space. For this rea- 
son, we use models that span a large range of possible physical 
recipes. The two basic SAM s we use in the work are taken from 
iNeistein & Weinmannl d2010h . which discusses six different spe- 
cific models spanning a large range of galaxy formation scenarios. 

This paper is organized as follows. In section [2] we describe 
the two different methodologies used in this work, namely ABMs 
and SAMs. The shuffling test, which reveals deviations between 
ABMs and SAMs, is discussed in section [3] Section [4] is devoted 
to a more systematic analysis of the various differences between 
the two methodologies. In section [5] we then suggest an extension 
to ABMs that is able to reproduce the results of SAMs. Lastly, we 
summarize the results and discuss them in section[6] 

This paper is based on the cosmological model with 
(fi m , Q.A, h, a s ) = (0.25, 0.75, 0.73, 0.9), whic h is the model 
adopt ed by the Millennium simulation used here dSpringel et al .1 
120051) . Mass units are M@ unless otherwise noted; Log designates 
Logio. 



2 METHODOLOGIES 
2.1 Abundance matching 

Abundance matching models (ABMs) link the stellar mass of a 
galaxy to its host subhalo mass using a simple and empirical 
methodology. The ingredients of the models can be summarized 
as follows: 

(i) Subhalo selection and mass definition: here we choose which 
subhaloes are allowed to host galaxies, and how their mass is de- 
fined. As a result, the abundance of subhaloes and their locations 
are set. 

(ii) Assume a one-to-one relation between the subhalo mass and 
the stellar mass of its galaxy: m sta r — M; n f a ii. More complex mod- 
els allow scatter, or use different relations for satellite and central 
galaxies. 

(iii) Solve for the m star — Minfaii relation, in order to reproduce 
the observed stellar mass function. 

For simplicity, we discuss ABMs which are aimed at producing 
a population of galaxies at z — only. Such models are usually 
tested against the auto-correlation function of galaxies at z — 
(see t]2.2.4| below). For a detailed review of AB Ms and the various 
uncertainties involved see lBehroozi et alj d2010h . In this section we 
further explain the various ingredients of ABMs and specify the 
actual model assumed here. 

We first define the subhalo mass, Mi n f a n, which will be used 
later in order to fix the stellar mass of galaxies: 
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Afinfall = 



Mh, p (Zinia,ll) 



if central within its FOF group 
otherwise 



(1) 



Here Zinfall is the lowest redshift at which the main progenitoi0 
of the subhalo Mh was the central of its FOF group, and Mh, P is 
the main progenitor mass at this redshi ft. We select all the sub- 
haloes from the Millennium simulation dSpringel et al.|[2005l see 
section [2~.2.1t at z — 0. Unlike many ABMs, we also consider sub- 
haloes that are not identified at z = if their galaxies survive within 
the SAM and did not have enough time to merge with the central 
galaxy. Even though these subhaloes are only identified at high red- 
shift, their Mi n f a n can still be computed acc ording to Eq.[T] 

The basic assumption of ABMs (e.g. IConrov et alj|200^) is 
listed in item (ii) above: the stellar mass of a galaxy depends solely 
on Mi n f a ii. Most ABMs assign a unique m s tar value per each sub- 
halo mass. However, several st udies allow a scatter in the mass 
relation, and IWang et al.l {x)0(h use different relations for central 
galaxies versus satellite galaxies (the mass relations are adopted 
from the results of a SAM in this last case). In this work we mainly 
discuss the most detailed ABM approach, where the m star — Mi n f a ri 
relation includes scatter, and might be different for satellite galax- 
ies. We will show that even when using this extended approach, the 
SAM behaviour is different from ABMs. 

Once the assumption on the nature of the m sta r — A/infaii rela- 
tion has been made, the last step in constructing the model is to find 
a relation m star — Minfaii that will reproduce the observed abun- 
dance of m star . If all the subhaloes from item (i) are populated 
with galaxies according to this relation, the resulting set of galaxies 
would have the observed stellar mass function by construction. The 
Minfaii relation can be thus obtained if one assumes that 
it follows some general functional shape, where the free parame- 
ters are constrained by matching the m stal - abundance. A different 
solution can be obtained by solving for the numerical values for 
■Minfaii in each mass bin separately. In this work we do 
not assume any functional shape a-priori, but rather adopt the same 
— Minfaii relation as in the SAMs. 

Although ABMs seem to adopt a rather simplified set of as- 
sumptions, it turns out that the clustering properties of galaxies fit 
the observational data well. This is encouraging, as clustering is 
the main test of the model (stellar mass functi ons are reproduced 
by construction). In addition, it was shown by [Wang et al. I d2006h 
that the results of a specific SAM agrees quite well with this ap- 
proach. All of these tests indicate that ABMs provide a reasonable 
solution to the relation between halo mass and stellar mass. It is not 
clear, however, whether this relation is the only possible solution. 



2.2 The Semi-analytical models 

In this section we briefly describe the SAM formalism being 
used in this work for modeling the evol ution of galaxies. For 
more details the reader is referred to iNeistein & Weinmannl 
d2010h (hereafter NW10). A version of the code used in 
this work is available for public usage through the internet (at 
http : / /www . mpa-garching .mpg . de/ galf orm/sesam). 



1 Main-progenitor histories are derived by following back in time the most 
massive progenitor in each merger event. 



2.2.1 Merger trees 

We use merger trees extracted from the Millennium N- 
body simulation dSpringel et all l2005h . This simulation was 
run using the cosmological parameters (Q m , S1a, h, erg) = 
(0.25, 0.75, 0.73, 0.9), with a particle mass of 8.6 x 10 8 h _1 M© 
and a box size of 500 h~ Mpc. The merger trees used here 
are based on subhal oes identified using the SUBFIND algorithm 
dSpringel et alj200l|x They are define d as the bound density peaks 
inside FOF groups dDavis et al More details on the simu- 

lation and the subhalo merger- trees can be found in lSpringel et al.l 
d2005l) and lCrotonetai1d2006h . The mass of each subhalo (referred 
to as Mh in what follows) is determined according to the number 
of particles it contains. Within each FOF group the most massive 
subhalo is termed the central subhalo of its group. 



2.2.2 Quiescent evolution 

Each galaxy is modeled by a 4-component vector, 



rrt s tar 

miiot 
^rtcject 



(2) 



where m s tar is the mass of stars, m co id is the mass of cold gas, 
rrihot is the mass of hot gas distributed within the host subhalo, and 
rrteject is the mass of gas which is located out of the subhalo and 
is not able to cool directly into the cold phase. We use the term 
'quiescent evolution' to mark all the evolutionary processes of a 
galaxy except those related to mergers. 

It can be shown (see NW10) that most of the quiescent pro- 
cesses included in SAMs can be written in a compact form by us- 
ing linear differential equations. We therefore adopt the following 
model for the quiescent evolution: 



rh = Am + BM h ■ 



(3) 



where Mh is the growth rate of the subhalo mass due to smooth 
accretion (i.e., mass which does not come within other subhaloes), 
and 
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In short, / s , f c , /d, ,/ c , /re are all functions of subhalo mass 
and redshift, and correspond to the efficiencies of star-formation, 
cooling, feedback, ejection, and reincorporation respectively. R is 
the constant recycling factor, which has the values of 0.43 or 0.5 in 
all the models used in this work. 



2.2.3 Satellite galaxies: dynamical friction, stripping, and bursts 

Satellite galaxies are defined as all galaxies inside a FOF group ex- 
cept the main galaxy inside the central (most massive) subhalo. 
Once the subhalo corresponding to a given galaxy cannot be re- 
solved anymore, it is considered as having merged into the most 
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massive subhalo which has the same descendant (the 'target' sub- 
halo). Due to the effect of dynamical friction, the galaxy is then as- 
sumed to spiral towards the center of the target subhalo, and merge 
with the central galaxy in that subhalo after a (potentially signifi- 
cant) delay time. We thus divide galaxies into three different types: 



(0 if central within its FOF group 
1 if central within its subhalo ('satellite') 
2 otherwise ('satellite') 



(5) 



Galaxies of type and 1 can serve as merging targets for type 2 
galaxies. A type 2 galaxy is not following the evolution of its orig- 
inal host subhalo, as its subhalo is not identified anymore. Conse- 
quently, type 2 galaxies do not have a well defined location given 
by the simulation data. We estimate the location of these galaxies 
by using the location of the most bound particle of the last iden- 
tified s ubhalo. Th i s met hod was used before by e.g. ICroton et al .1 
j2006l) ; |GuoetalJ ( l201(]h . 

In our model, the time it takes the galaxy to fall into the central 
galaxy is determined by dynamical friction, and is not directly re- 
lated to the true evolution of the location of the most bound particle 
with respect to its target. At the last time the dark matter subhalo of 
a satellite galaxy is resolved we compute its distance from the tar- 
get subhalo (r sat ), and estimate the dyna mical friction time using 
the formula of lBinnev & Tremaind dl987l) . 



tdf = Qdf 



.17V„rL 



Gm sat In (1 + M h /m s , 



(6) 



Here Mh is the mass of the target subhalo and V v is its virial veloc- 
ity. The value of m sat should correspond to the mass of the satellite 
galaxy which is affected by the dynamical friction process. We use 
two options for m sa t as listed below: 



m s t E 



+ m cold + Mh' 



(a) 

(b) 



(7) 



Here Mh, m in is the minimum subhalo mass of the Millennium sim- 
ulation (1.72 x 10 10 h~ l M & ), and M h is the last subhalo mass 
identified, just before the subhalo has merged into a bigger one. In 
addition to the freedom in choosing m sat , a free parameter, ayf, 
was added to Eq.|6]in order to easily modify the dynamical friction 
time. When a satellite falls into a larger subhalo together with its 
central galaxy we update ta and the target subhalo, for both objects 
according to the new central galaxy. 

While satellite galaxies move within their FOF group, they suf- 
fer from mass loss due to tidal stripping. Only the ejected and hot 
gas of the satellite can be stripped in our model. We assume that 
this stripping has an exponential dependence on time, using the 
same time scale for all galaxies. In order to model this stripping we 
modify A by subtracting a constant ah from two of its elements: 



A sat (3,3) = 
A sot (4,4) = 



—fa — Oth 
—fie — OLh 



(8) 



This constant suggests an exponential decrease in the amount of hot 
and ejected gas. However, the actual dependence of these compo- 
nents on time is more complicated due to contributions from other 
processes, as seen in Eq.|4j 

When satellite galaxies finally merge we assume that a 
SF burst is triggered . We follow Mihos & Hernquistl Jl994l) : 
ISomerville etalTj200ll) : ICox et all GW$\ and model the amount 
of stars produced by 



a l mi 

Am star = Q.b | 

m 2 



(mi, C oid + m 2 ,coid) ■ 



(9) 



Table 1. The different SAM models used in this paper. The models are 
divided into two groups, A & B. Within each group the specific models 
are the same except the details given in this table. Different m sa t mass 
estimates for computing dynamical f riction are given i n Eq.[7] Model A4 
evolves satellite galaxies according to lWeinmann et al] 120 lOh 
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Here rrii are the baryonic masses of the progenitor galaxies (cold 
gas plus stars), m 4iCO id is their cold gas mass, and at, ct c are con- 
stants. 



2.2.4 Correlation functions 

In this paper we study the clustering properties of various mod- 
els using the projected auto-correlation function, w p (r p ), termed 
CF hereafter. Observational data were obtained from the full SDSS 
DR7 rele ase, using 1/Vm ax weighting, i n the same metho d as de- 
scribed in lLi et alTd2006t) and presented in lGuo et"aT I (Haul. Errors 
are estimated from a set of 80 mock SDSS surveys mimicking cos- 
mic variance effects. The CFs are split into five stellar mass bins, 
and are plotted as error-bars in Fig.[TJ(the same data points are used 
throughout this work). 

In order to compute w p (r p ) for the models, we use only two 

, r y , omitting 



coordinates for the location of model galaxies (i.e. 
the r z dependence). We then count the number of galaxy pairs, 
N v , within the same stellar mass bin, and at a given separation r p 



(r p 



+ Ty). The CF is computed from N p using: 



w p {r p ) = 



L N, 
N 2 V v 



?■ - 1 



(10) 



Here V p is the 2-dimensional area covered by the bin r p , N is the 
total number of galaxies in the sample, and L is the size of the simu- 
lation box in h~ 1 Mpc. The resulting w p (r p ) has units of /i _1 Mpc. 
In order to save computational time, we compute CFs only for a 
random subset of 10 per cent of the galaxies within the lower three 
mass bins (we use all galaxies for the two most massive bins). We 
checked that there is no difference between the CF computed using 
the partial sample and the full sample from the Millennium simu- 
lation used here. 



2.2.5 Specific models 

We choose to examine the results of two models which were orig- 
inally presented in NW10. Other models from NW10 have similar 
results and are not adding a significant information to this work. 
We use model II from NW10, in which SF, feedback, and mergers 
are treated in a similar way as in standard SAMs. Three main mod- 
ifications to usual SAMs are adopted in this model: i) There is no 
ejection of gas out of the subhalo (m c j cct = 0); ii) the SF law does 
not include a threshold in cold gas density; iii) Cooling rates (/ c ) 
are tuned to reproduce a large set of observational data. This model 
will be termed model AO in this paper. The second model being 
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Figure 1. The projected auto-correlation functions derived for model ^40. Each panel corresponds to galaxies with stellar masses as indicated by the range 
of LogA/Q. Solid lines show the results of the original model, dashed lines are plotted using shuffling within [Mj n f a n,type], dashed-dotted lines represent 
shuffling withi n [M; n f a n] on ly. See ij3]for more information about the shuffling procedure. The observational data are using SDSS DR7 with the same 
technique as in iLietalJiioOek and are shown as error bars. In the two most massive bins we add labels for the y-axis values of the corresponding line. Poisson 
errors for the model results are smaller than 10 percent for all points, and would not be visible in this plot. 



used here is the one based on lDe Lucia & Blaizoj J2007h (model 
in NW10), and is termed model B0. 

There are a few differences between the results of models AO 
and BO that are important for the following sections. The stellar 
mass functions of these models are very different (see Fig. [10] in 
the Appendix). Model AO fits the observed stellar mass functions 
for z < 3, while model 50 shows significant deviations at low 
and high masses. We will see below that this difference affects the 
relation between the subhalo mass and stellar mass, and the CFs. 
The treatment of satellite galaxies is also different between these 
models. The dynamical friction and stripping time-scales are longer 
in model AO. Consequently, satellite galaxies survive longer, and 
are able to form a significant amount of stars after joining the group. 
Longer dynamical friction time-scales also result in more galaxies 
of type 2 in model AO. 

As we will show later in this work, the results of the CFs are 
sensitive to the treatment of dynamical friction and gas stripping 
in satellite galaxies. In order to further investigate these effects we 
run a few variations of the original models. Models A1,B1 are the 
same as models '0' except that satellite galaxies are not allowed to 
grow in their stellar mass. We artificially shut down all modes of 
SF and do not allow merging into galaxies of type 1 & 2. These 
models mimic extremely fast stripping mechanisms. 

In models A2,B2 we exchange the recipes of dynamical fric- 
tion and stripping between models AO & B0, showing the effect of 



these ingredients on the results. This will help us disentangle be- 
tween the effects of quiescent evolution, and treatment of satellites. 

Lastly, in model A4, we use a prescription for the e volution 
of satellite galaxies as proposed bv lWeinmann et al] | |2010|) . In this 
model gas stripping follows the stripping of the subhalo mass (as 
calculated from the A^-body simulation), where the first quantity to 
be stripped is ra c j cc t. Only when m e j C ct reaches zero does strip- 
ping of the hot gas start. For type 2 galaxies, we use exponential 
stripping of hot gas, using ah defined in Eq. [9] above. Model A4 
was developed in order to match in detail the properties of satel- 
lite galaxies, and is thus the most physically motivated model here, 
in terms of environmental effects within the halo. When tuning the 
dynamical friction time scale in Model A4 we have tried to better 
reproduce the observed CFs (see Fig[9]in the Appendix). 

Table[TJsummarizes the various models used in this work. 



3 THE SHUFFLING TEST 

We want to check whether SAMs include some clustering infor- 
mation that is lost by the assumptions of ABM. A simple test to 
this problem is to adopt the same m s tar — Mi n f a n relation as in the 
S AM, and use i t with in an ABM application. Such a test was done 
bv lWang et all feOOfJ) . Here we choose to use the shuffling proce- 
dure for testing ABMs against SAMs. In general, we will randomly 
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Figure 2. The projected auto-correlation function for the models listed in 
Table[T] Only galaxies with stellar mass 11.27 ^ m a tar ^ 11.77 LogA^Q 
are selected (corresponding to the most massive stellar bin in Fig.[T}. Solid 
lines are the original (un-shuffled) CFs. Dashed lines show the shuffling 
results of each model (shuffling is preformed within galaxies of the same 
Af; n f a n and type). The CFs of different models are multiplied by powers of 
10 for clarity. 



re-distribute the location of SAM galaxies with the same Mi n f a u, In 
this way we mimic an ABM model with the same subhaloes as used 
in the SAM, and with exactly the same distribution of m s tar per a 
given Mi n f a ii. Information about the clustering of galaxies that is 
not related to Mi n f a ii should be eliminated by this procedure, and 
its effect on the CFs can be easily tested without modeling the spe- 
cific ?n sta r — Minfau relation. Moreover, it will be easy to see how 
different modifications of SAMs behave under shuffling, and to add 
constraints on the shuffled groups. 

We first run the SAMs and construct a catalog of galaxies at 
z = 0. For each galaxy we save its location at z = 0, its type (0/1/2, 
see Eq. |5), its host Mj n f a ii, the infall redshift Zj n f a ii, and the host 
FOF group mass at z = 0, M2ocEl We then split the population of 



2 A^200 i s defined as the mass within the radius where the halo has an 
over-density 200 times the critical density of the simulation. 



galaxies into groups of the same Mi n f a ilf|. Within each group of 
galaxies we randomly re-distribute the stellar masses at the group 
locations. In order to further explore the clustering properties of 
SAMs, we will refer in this work to a few versions of shuffling: 

• Shuffling within [Mj n f a ii] 

• Shuffling within [A'ii n faii,type] 

• Shuffling within [M in faii,type,z inf aii] 

• Shuffling within [Minfaii,type,Af 2 oo] 

In each case we split the catalog of galaxies into groups where the 
values of the variables listed in square brackets are the sameQ. The 
shuffling is then implemented in each group separately. When com- 
puting the CF for massive galaxies, we always run 20 different ran- 
dom realizations of shuffling, and plot the average CF values. 

In Fig. Q] we compare the CFs of model AO against two basic 
shuffling tests. When we shuffle between all galaxies of the same 
A/i n faii, the resulting difference in the CF is very large. It reaches 
a factor of ~ 4 in the most massive bin, and a factor of ~ 2.5 in 
the second most massive bin. These large differences indicate that 
treating satellite and central galaxies with the same m star — Mi n f a ri 
relation does not agree with SAMs. We will show below that the 
TT^star — -Minfaii relation for central galaxies is very similar to that 
for satellites in all of the models used here when considering low- 
mass galaxies. This is probably the reason why the CFs at the low 
mass bins do not show a change after shuffling. In this paper we 
do not attempt to reproduce the observed CFs (except for model 
A4 as explained above), but rather compare the results of different 
models. Thus, the observational data presented in Fig. [TJ are only 
meant to provide a basis for comparison between the models. There 
is probably no special meaning to the fact that the shuffled samples 
better agree with the observed CFs. 

Interestingly, the model shows significant change in the CF at 
the higher stellar mass bin, even when performing shuffling within 
[Minfaii, type]. This difference reaches a factor of ~ 2 at small 
scales. More minor differences are apparent in the other mass bins, 
at the level of 15 to 40 per cent. These differences indicate that the 
SAM contains clustering information that depends on something 
other than A/i n faii and galaxy type. 

We have tested the effect of shuffling within [A/i n f a ii,type] on 
all the models listed in Table [TJ and found that variations between 
the different models are relevant mainly for the most massive stel- 
lar mass bin. We therefore show the CFs for this bin, and for all 
the models, in Fig. [2] Some models do not show any difference 
between the original and the shuffled samples, while other mod- 
els exhibit substantial differences. Consequently, model AO is not 
unique in showing differences in CFs due to shuffling. Although 
mod el B0 does not in clude any change in the CFs (in agreement 
with lWang et alj|2006t) . changing the dynamical friction and strip- 
ping time-scales in model B2 introduces a change in the CFs due 
to shuffling. 

In order to find the reason for the behaviour of the shuffled 
catalogs we first did the following tests: 

(i) We shuffled only galaxies with one specific type, leaving all 
other type of galaxies unchanged. The results of this test are that 

3 We allow A/infaii to deviate in 0.1 dex, so each group includes a large 
number of galaxies. We have verified that using bin sizes of 0.05-0.2 do not 
affect the results presented here. 

4 The bins used for Zinfall are °f s i ze 0-1 m L°gz, and bins for M200 are 
of size 0.5 dex. Decreasing the bins size by a factor of a few do not change 
the results shown here. 



Linking haloes to galaxies 7 




Log M infaU 



Figure 3. The abundance of subhaloes with mass A'/; n f a u, obtained by as- 
signing A'/; n f a n for each galaxy from the SAM catalog at z = 0. For type 2 
galaxies, we assign M in f a n according to the subhalo in which these galax- 
ies were last identified as type 1 or 0. Different line types correspond to 
different specific SAMs, as listed in Table [T] The diamond symbols show 
the abundance of Mj n f a n for subhaloes which are identified at z = only. 

only galaxies of type 1 & 2 contribute to the shuffling difference. 
Central (type 0) galaxies do not change their clustering properties 
because of shuffling. We will therefore examine below mechanisms 
which affect the evolution of satellite galaxies. 

(ii) We tested different definitions of the host subhalo mass in- 
stead of Minfaii: the maximum mass in the history of the subhalo, 
and the mass at the first infall identified by the simulation (our usual 
definition of Mi n f a n is at the lowest redshift where the subhalo was 
identified as central, i.e. the last infall). All these definitions give 
rise to the same CF sensitivity under shuffling. 

In the next section we will discuss the various differences between 
SAMs and ABMs in order to better understand the possible reasons 
for the effect of shuffling on the CFs. 



4 DIFFERENCES BETWEEN ABM & SAM 

The results of the shuffling test shown in the previous section indi- 
cate that galaxies produced by ABMs cluster differently than SAM 
galaxies. In order to further study this effect we closely follow each 
of the assumptions made by ABMs and see if it is fulfilled by our 
SAMs. Our motivation is to study second order effects, which are 
currently not being used by ABMs, that contribute to the clustering 
properties of galaxies. 

4.1 Mass function of subhaloes 

ABMs often consider subhaloes which are identified at the in- 
spected redshift only (i.e. at z = 0). Although modern high- 
resolution cos mological simulatio n s seem to resolve substructure 
well enough fconrov et all 120061 : iMoster et all l20ld : iGuo et all 
[2013), there is still som e uncertainty whether this resolution is ad- 
equate jGuo et all20ich . On the other hand, SAMs follow galaxies 
even if their host subhalo has already merged into a bigger one. 
This is done by allowing galaxies to survive an additional time in- 
side their host halo according to dynamical friction estimates (see 



S]2.2.3I >. As a result, the number of galaxies at z = in the SAMs 
might be significantly higher than the number of subhaloes at the 
same redshift, where the additional galaxies are all marked as type 
2. 

In order to quantify this effect we assign Mi n f a u for each 
galaxy in the SAM catalog at z — 0, as was explained in section 
12.11 For type 2 galaxies we use Mi n f a n according to the subhalo 
when the galaxy was last identified as type or 1. The resulting 
mass functions for all the models are plotted in Fig. [3] It is evi- 
dent that all of the SAMs contain a non-negligible number of type 
2 galaxies at z = 0. In models that use a long dynamical friction 
time scale, the difference is bigger, and can be significant even at 
masses that are two orders of magnitude above the minimum mass 
of the simulatiorff]. 

Although the difference in the number of galaxies is some- 
times small in comparison to the total population, these type 2 
galaxies might still affect the CF. In particular, such galaxies are 
usually located close to the central galaxies of their FOF group, and 
thus contribute to the CF at small scales, where a small number of 
galaxies can make a significant difference. The effect can be seen 
by comparing the CF of model ^40 against model A4 (Fig.[2}- These 
models mainly differ in the way dynamical friction is modeled, re- 
sulting in a significant change in the CF at small scales. 

The shuffling test done here does not change the number of 
type 2 galaxies so different mass functions of Mi n f a n do not affect 
this test directly. This can be seen when comparing Fig. [2] and [3] 
there is no obvious correlation between models that are sensitive to 
shuffling and models with high abundance of Mi n f a n. 



4.2 Redshift dependence 

In Fig.|4]we show the m s t ar — Mj n f a n relation for the SAMs used 
here for various redshifts. Clearly, the relation between stellar mass 
and subhalo mass depends on redshift for central galaxies, as wa s 
found by previous ABM studies (e.g.. lConrov & Wechsledl2009h . 
This dependence is mainly related to the evolution of the stel- 
lar mass functions with redshift. At the massive end, the average 
m star — -Mi n f a n relation is also affected by the scatter in m ata r for 
a given value of A/i n f a n. For low mass galaxies, all the models show 
lit tle or no evolution with reds hift, in contrast to the relation derived 
bv lConrov & WechsleJ f200gh . This might be a consequence of the 
different stellar mass functions used by these authors. A different 
important point to note is that the A models show very different 
behaviour than the B models. This difference is an outcome of the 
stellar mass function and its evolution with redshift. The B mod- 
els produce too many small mass and high mass galaxies, so their 
Tiatar — Afi n f a n relations are probably incorrect. 

The dependence on redshift, which is rather obvious for cen- 
tral galaxies (smooth lines in Fig. |4j, is also important for satellite 
galaxies identified atz = 0. In Fig.|4]we plot the average m star for 
satellite galaxies with different Zi n f a ii as symbols. In the A mod- 
els, galaxies with higher infall redshift have higher current stellar 
mass, while the B models show the opposite trend. This effect is 
not included in current ABMs when matching the m sta r — Mi^u 
relation at redshift zero. Shuffling between satellite galaxies of the 
same Mi n f a n will mix galaxies of different Zi n f a ii, an effect that 



5 Various ABMs take this effect into consideration and add subhaloes 
from high redshift according to dynamical friction estimates, see e.g. 
iMosteretailfeOlOl) . 
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Figure 4. The m s tar — Afinfall relation for the various models used in this work and for different redshifts. Each panel shows results from a specific model 
as listed in Table[T] All the values of m s t ar were derived by computing the average per a given M; n f a n mass bin. Lines show m s t ar for type galaxies which 
are selected at the indicated redshift. Symbols show m s t a r for type 1 & 2 galaxies identified at z = 0, split according to their infall redshift Zi n f a ii. For type 1 
& 2 galaxies, we allow Zinfan to deviate in Az = ±0.2 from the indicated z, label (Az = ±0.5 for z = 3). 



might change the CF and contribute to the differences shown in 

Fig.m 

We note that the average m star per a given Zinfaii seen in Fig. [4] 
does not provide enough information in order to quantify the ef- 
fect of shuffling, as it does not include information on the number 
of galaxies per each Zi n f a ii- The population of satellite galaxies is 
dominated by low values of Zinfaii, so only values of Zi n f a n < 1 are 
relevant. This issue will be further examined in section|5] 

4.3 Evolution at early times 

In the previous section we discussed the correlation between Zi n f a ii 
and m star for satellite galaxies, and for a given M; n f a ii. However, 
the average m star for satellite galaxies at z = with a specific 
Zinfaii = Zo is different from the m star of central galaxies at the 
same zo- This is seen by comparing lines and symbols in Fig. [4] As 
a result, one cannot use the m star — Mi n f a n relation that was de- 
rived for central galaxies at high-redshift (e.g.. lConrov & Wechsled 
|2009|) . in order to model the m s t ar — Mi n f a n for satellites at z = 
with various Zinfaii- Here we will explore one reason for this ef- 
fect, namely the difference in m star at the time of infall. In the next 
section we will study the growth in m star after Zinfaii- 

For a distinct FOF halo, it is well established that the large- 
scale over-density of its environment is correlated with the halo for- 
mation history. According to this 'environmental effect', haloes of 
a given mass living in denser environments are formed earlier (this 



is also termed the 'assemb ly bias' : iGao et all 120051 : lHarker et al .1 
l200d IWechsler et alj|2006h . The effect introduces a weak corre- 
lation between the stellar mass of galaxies in the SA Ms and the 
environment of their host FOF halo dCroton et alj|2007h . As galax- 
ies identified just before Zinfaii live in denser environments than 
the average central galaxy, this effect might generate a difference 
in m s t ar between all central galaxies and future satellite galaxies. 
Moreover, for satellite galaxies that fall into a larger halo at high-z, 
the environment is correlated to the mass of the target FOF group at 
z = 0. Thus, we expect that m star will be correlated with the group 
mass M200 and not only with Mi n f a n. 

We first test this effect for the underlying dark-matter evolu- 
tion, using subhaloes from the Millennium simulation. In Fig. [5] 
we show the main-progenitor histories of subhaloes with the same 
A^infaii and Zinfaii- We split the average histories into different sub- 
sets of equal M200 (the host of these subhaloes at z = 0). Also plot- 
ted in the same figure are the average histories of central subhaloes 
identified at the same Zinfaii - Indeed, merger-histories at early times 
are correlated to the M200 mass at z = 0, especially if the infall 
redshift is low. This effect is very similar to the environmental ef- 
fect mentioned above, where subhaloes that fall into more massive 
M200 form earlier (i.e., the redshift which corresponds to half the 
current mass is higher). 

In order to test the effect of M200 on early formation histo- 
ries of galaxies, we stop all modes of SF and merging for satellite 
galaxies in models Al and SI. In all other aspects these models 
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Figure 5. Main-progenitor histories of subhaloes with the same infall mass and redshift, split by their host mass at z = 0, A/200 - Each panel includes histories 
for a given values of Zj n f a ii and iWinfaU- We track the main-progenitor histories starting z = 0, so only subhaloes which survive until z = are included. 
Diamond symbols show the averaged main-progenitor histories for all central subhaloes which have the same mass at the infall redshift. Each M200 bin 
includes values within 0.2 dex in LogM©, where the number of main-progenitor histories used in each bin is given in parentheses. All values are taken from 
the full Millennium simulation. Time is in Gyr since the big-bang. 



are the same as models AO and BO. Consequently, in models Al 
and Bl, the stellar mass of satellite galaxies at z — is identical to 
their mass at ZiuiaXl- The reader is referred again to Fig. [4] where the 
Afinfaii relation is plotted for models Al and Bl. Not sur- 
prisingly, the stellar mass of satellite galaxies is already different at 
Zinfaii from the stellar mass of central galaxies. This difference can 
reach 70 per cent in model Al for the average value of m s t ar , at 
M infaU ~ 3 x 1O 12 M . 

The effect of M200 on m a tar at early epochs can modify the 
CFs, and might contribute to the effect of shuffling discussed in 
sectionf3] The reason that there is no change in the CF due to shuf- 
fling in models Al and Bl might be that the effect of environment 
is maximal in these models, which seems to cancel the contribution 
of other effects going in the opposite direction. We will discuss this 
issue further in section|5] 



4.4 Stellar mass growth after infall 

The amount of stellar mass gained by galaxies of type 1 & 2 after 
Zinfaii can be seen in Fig. [4] when comparing the various models 
to models Al,Bl. Models with slow gas stripping within satellite 
galaxies (AO, Al & B2) show a significant increase of mass after 
infall, while models with fast stripping show only a minor change 
(BO & A2). In models with slow stripping mechanism, more hot 
gas is available for cooling, and consequently more cold gas can 
reach the disk and form stars. Both observational and theoretical 
studies indicate that slow stripping is the preferred scenario for 
mode ling the environmenta l evolution of galaxies within their host 
halo ^McCarthy et alJl2008l:lFont et al.ll2008l ; lKhochfar & OstrikeJ 
120081 - IWeinmann et al j201Ch . 



10' 



In model AA we use stripping of hot gas which is propor- 
tional to the dark-mat ter stripping. This scenario was found by 
IWeinmamTe t al. ( 2010) to reproduce the fraction of passive satellite 
galaxies as a function of group mass and distance from the group 
center. In this case, the stripping rate will be correlated to the dark- 
matter evolution. From Fig. [5] it seems that the dark-matter evolu- 
tion of subhaloes after Zinfaii is affected by Af20o, the group mass at 
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Figure 6. The CFs for various models using a range in m s t ar as indicated 
in units of LogMQ. For each model we run a few different shuffling tests, 
where the groups of shuffled galaxies are constrained to have the same 
set of properties: iWinfai] and galaxy type (dashed lines); Mi n f a n, galaxy 
type, and Zj n f a ii (dashed-dotted lines); Af; n f a n, galaxy type, and M200 (di- 
amonds). An ABM model using two parameters for fixing the stellar mass 
of galaxies, namely M; n f a n and M200, is shown to reproduce the CFs of 
the SAM galaxies. 



z — 0. This is probably a natural consequence of the tidal interac- 
tion within the group. As a result of the above, the amount of m s t a r 
gained after infall should correlate with M200 in this model. On the 
other hand, it may be expected that galaxies with higher Zi n faii will 
have more time to increase their m a tar, so correlation with Zi n f a ii is 
also expected. 
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Figure 7. The behaviour of M200 and Zinfall as a function of Mi n f a n and m s tar for satellite galaxies (type 1 & 2). Upper panels: median values of Log 
-M200 /Af; n f a n; Lower panels: median values Zinf a ii- Color coding is set to span 95 per cent of the population of Zi n f a u and M200 /Onfall- Each panel shows 
results for one model as indicated. For the list of all models please see Table[T] Median values are computed for bins with more than 10 galaxies. Diamonds 
show the median m s tar for all galaxies in the same Mj n f a n bin. 



5 ABM WITH AN ADDITIONAL PARAMETER 

In section[3]we saw that SAMs include additional clustering infor- 
mation which is not reproduced by a dependence on Mi n f a n only. 
This was demonstrated by the different CF results when perform- 
ing shuffling. We then learned that a single m star — Af; n f a n rela- 
tion cannot reproduce accurately the behaviour of satellite galax- 
ies. It is not easy to disentangle the different reasons for the devi- 
ation in the m star — Mi n f a ii relation, and their affect on the CFs. 
This will require studying SAMs which include only one devia- 
tion per a model. For example, we would need to develop a SAM 
in which there is no redshift evolution in the mass relation, but 
satellite galaxies do increase their stellar mass function after infall. 
Here we choose to present a more modest test. Instead of modify- 
ing the SAMs, we will present a modification of ABMs which is 
able mimic the behaviour of SAMs better. 

The target of this section is to present a more detailed version 
of ABMs which is relatively simple to use, and is able to fit the CF 
of SAMs. This is achieved by adding an additional parameter to 
ABMs, so the stellar mass of a galaxy is fixed not only by Mi n f a n 
and the galaxy type, but also according to an additional ingredient. 
According to the previous section, there are two parameters which 
naturally affect the stellar mass of satellite galaxies: the infall red- 
shift (Zinfali)i and the group mass at redshift zero These 
parameters are correlated because of the hierarchical build-up of 
haloes (galaxies inside more massive groups tend to have a higher 
Zinfaii)- Consequently, it is hard to separate the effect of each pa- 
rameter on the models. 



An important criterion for choosing the second parameter is 
the behaviour of the shuffled CFs. In order to check this issue, 
we present two additional shuffling tests. For each of the param- 
eters tested (p = Zinfaii, M200) we divide the catalog of galaxies 
into groups of the same Minfali, type, and p values as discussed in 
section [3] The location of galaxies are shuffled within these new 
groups. The resulting CFs are shown in Fig. [6] For all the models 
tested here, using the group mass, M200, results in no change in the 
CF after shuffling. This means that SAMs have no additional clus- 
tering information except the dependence on Minfan, A/200, and 
galaxy typ^f). Consequently, ABMs that would use M200 to fix the 
stellar mass in addition to Minfall would be able to reproduce the 
results of SAMs. Such models might have more freedom and could 
span more possible — Mnfaii relations. 

Naively, one may expect that the main variation in the m sta r — 
Mintaii relation should be due to the the additional dependence of 
satellite mass on Zinfaii- However, this parameter does not allow 
ABMs to better follow the CF of SAMs. In Fig. [6] adding zinfaii 
as a second parameter makes the CF of model B2 to deviate even 
more from the original CF. Adding Zinfaii to model AO improves the 
CF, but it does not reach the accuracy achieved by using M200 ■ The 
failure of Zinfaii to improve ABMs is surprising, as it naturally af- 
fects the stellar mass of satellite galaxies. However, it seems that its 



6 According to lCroton et alj b007t) there are deviations on the order of 10 
per cent because of environmental effects on scales larger than the halo. 
These variations are too small to be seen in Fig. [6] 
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correlation with stellar mass is weaker than the correlation between 
M200 and stellar mass. 

We have examined various statistical aspects of the effect of 
M 20 o and Zi n f a n on m star (per a given M infaU ). The main obsta- 
cle in presenting a clear-cut evidence for why M200 works better 
than zjnfaii is the different abundance of satellite galaxies as a func- 
tion of Zi n f a n and M 200. For example , most satellite galaxies have 
Zmfaii < 1 (see e.g. iGao et af]|2004l) . but have a different unique 
range in M200 /Minfaii. A clear comparison should take this num- 
ber distribution into account. We therefore plot in Fig.|7]the median 
values of M200 and Zinfaii per each bin 111 7Tl s tar 

and Mi n f a ii. When 

comparing M200 against Zi n f a ii, the number of galaxies per each bin 
remains the same. However, we note that most of the galaxies are 
concentrated around the median m s t a r values (diamond symbols). 

Fig. [7] shows the reason for the success of M200 as a second 
parameter. The dependence of m star on K'hoa (for a given Mj n f a n) 
is roughly monotonic and smooth. For a given Minfaii, the values 
of M200 span nicely the scatter in m s t ar . A different behaviour is 
seen for Zinfaii- The relation between m 8 tar and zinfaii for a given 
A/i n f a ii is not monotonic. This means that the dependence of m s tar 
on Zi n f a u is in general not unique, so very different m star corre- 
spond to the same Zinfaii- For example, consider model B2 (lower- 
right panel): shuffling galaxies with the same A:/i n f a ii ~ 10 13 and 
Zinfaii ~ 1 will result in swapping galaxies between very different 
stellar masses (the dark regions). As can be seen from the upper 
panel, the swapped galaxies live inside different FOF groups, so 
this shuffling will affect the CF significantly. This is probably the 
reason why Zi n f a u does not improve the shuffled CF in model B2. 
We note that the range in m star used for testing the shuffled CFs in 
Fig. H] corresponds to Minfaii > 1O 12 M0. As the number of sub- 
haloes of a given Mi n f a ii decreases quickly with increasing Minfaii; 
the CF in this mass range is mostly affected by Minfaii ~ 1O 12 M0. 



6 SUMMARY AND DISCUSSION 

In this work we have studied the relation between the mass of sub- 
haloes (Minfaii) and the stellar mass of galaxies. A simple model 
for this relation is crucial for summarizing the basic concepts of 
galaxy formation physics. It can then be used to model the correla- 
tion function of galaxies, their stellar mass function, star-formation 
rates, and merging processes. The abundance matching approach 
(ABM) is aiming at constraining this mass relation directly, with- 
out the need to rely on the assumptions of a specific model. This 
constraint is important because other theoretical methodologies de- 
pend strongly on the detailed physics assumed, and are usually not 
able to match the observational constraints accurately. It is there- 
fore necessary to study the assumptions made by the ABM ap- 
proach, and their effect on the mass relation above. 

In this paper we have tested the basic assumptions of ABM 
mo dels using a set of Sem i -Anal ytical Models (SAMs) developed 
by iNeistein & Weinma nn (2010). We examined the relation be- 
tween stellar mass and Mi n f a ii within the SAMs at different red- 
shifts, and for satellite versus central galaxies. It was shown that 
a single m s tar — Mi n f a il relation is not able to capture accurately 
the full complexity of SAMs, due to complex behaviour of satellite 
galaxies. First, the stellar mass of these galaxies behaves differently 
at early times due to an envi ronmental effect acting on their hos t 
haloes (the 'assembly bias'; IGao et alj|2005l : lHarker et alj|2006h . 
Satellite galaxies falling into more massive groups have different 
dark-matter merger-histories, and are more massive even before 
falling into their group. Second, the mass of satellite galaxies de- 



pends on the infall redshift (zin&ii, defined as the time they became 
satellites). This is because they roughly follow the global behaviour 
of central galaxies. Third, satellite galaxies acquire a significant 
amount of stellar mass after Zinfaii. especially if the model assumes 
slow gas stripping mechanisms. This growth can reach a factor of 
~ 2 in stellar mass, for galaxies with zinfaii — 1- 

We have used the shuffling technique, where the location of 
all model galaxies that have the same Mi n f a ii and galaxy type (cen- 
tral/satellite) are swapped randomly. Assuming that the stellar mass 
of galaxies depends only on the host subhalo mass (Minfaii), is 
equivalent to using a shuffled sample of galaxies. In order to check 
the effect of this assumption on the clustering properties of galax- 
ies, we compared the auto correlation functions (CFs) of the orig- 
inal SAMs against the shuffled catalogs. We found that shuffled 
catalogs of galaxies can have different CFs, depending on the spe- 
cific model being tested. The shuffled CFs are very different if we 
assume the same m star — Minfaii relation for satellite and central 
galaxies, reaching a factor of ~ 4 at scales below 1 Mpc. However, 
even when using a different m 8 tar — Minfaii relation for central and 
satellite galaxies, this difference can reach a factor of 2 in the CF 
of the most massive galaxies. 

The results shown here are based on a few specific SAMs and 
might not reflect the 'true' physical Universe. It might be that the 
assumptions adopted by ABM models are correct, and SAMs in- 
troduce more complexity than what is needed. However, it might 
be that the 'true' model includes a different set of assumptions, or 
is more extreme in violating the assumptions tested here. A differ- 
ent limitation of this work is that the results of the auto-correlation 
functions obtained here are based on the Millennium simulation. 
This simulation assumes erg = 0.9, which is higher than the lat- 
est favorable estimates. As a result, the relation between the model 
CFs and the observed data is of less interest to this study. We did 
not aim at providing a model which reproduce the observed values 
of both the stellar mass function and the CFs. 

We have checked two additional parameters which could be 
adopted by future ABM models in order to better reproduce the 
results of SAMs. For the SAMs used here, assuming that the stel- 
lar mass depends on Minfaii, M200 and galaxy type is enough to 
reproduce the CF of galaxies (M200 is the FOF group mass at red- 
shift zero). The other natural parameter, Zinfaii does not provide a 
good match to the clustering of SAMs probably because it is less 
correlated with the stellar mass. Using M200 as a second param- 
eter makes ABM models more similar to HODs, in which all the 
properties of galaxies are fixed by the group mass. 

Our results may be relevant for interpreting the observed en- 
vironmental dependencies of galaxies. While various results stress 
the imp ortance of late evolution on the properties of satellite galax- 
ies (e.g. lKauffmann etal .120041) . we find evidence for the influence 
of early evolution. At a fixed Mi n f a ii, galaxies ending up in massive 
clusters undergo a more rapid growth of their dark matter halo at 
early times, which may influence their observed properties at late 

times, like their morphology or star fo r matio n rate. 

Studies like Ivan den Bosch et al.l J2OO8I) and lWeinmann et all 

d2009h investigate the impact of environment b y comparing satellite 
and ce ntral galaxies at fixed stellar mass todav. lvan den Bosch et al.l 
j2008l) argue that this is reasonable, as most satellites fall in late, 
and thus no large difference in the evolution of stellar mass between 
central and satellite galaxies since this time is expected. However, 
as we have shown here, the progenitors of today's satellites and the 
progenitors of today's centrals may be different already at redshifts 
before the time of infall. Depending on the exact nature and magni- 
tude of this effect, it may complicate the direct comparison between 
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central and satellite galaxies. In a future work we intend to study 
the differences in stellar mass between satellite and central galax- 
ies in more detail, making use of the two parameter ABM approach 
suggested in this work. 
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APPENDIX 

In this Appendix we provide more CF plots for models B0 and A4 
(Figs.[8][9]l. We also plot the stellar mass function for all the models 
used in this work against the observational data in Fig.UOl 
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Figure 8. The projected auto-correlation functions derived for model B0. Each panel corresponds to galaxies with stellar masses as indicated, in units of 
1.ih: A/ . . Solid lines show the results of the original mo del, dashed lines are plotted using shuffling within [M; n f a u,type]. The observational data are using 
SDSS DR7 with the same technique as in iLi etaljf2006l) . and are shown as error bars. 




Figure 9. Similar to Fig. [8] but for model AA. 
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Figure 10. Stellar mass functions of galaxies at various redshifts, for our different models as indicated in the top left panel and as summarized in table [T] 
The observational data is plotted with gray symbols. At all redshifts hig her than zero we co nvolve t he model stellar mas ses with a Gaussian error dist ribution, 
with standard deviation of 0.25 dex. At z = we use observations bv lLi & W hite (2009, circles), iBaldrv et alj fe008t crosses), IPanter et"al] 12007), pluses). 
Note t hat other observational st udies predict a sligh tly different function (e.g.]Ben^i^e^ilj2O10|x At hig h-z we use the following observations: iBundv et alj 
l200d z = 0.75-1, circ les), iBorch et al J l200d z = 0.8 - 1, crosses) IPerez-Gonzalez et all J2008t z = 0.8 - 1, z = 1.6 - 2, z = 2.5 - 3, plus 
signsl lFontana et alj JioOfj z = 0.8 - 1, z = 1.6 - 2, z = 2 - 3, stars ! iDrorv et all J2004I. z = 0.8 - 1, upward-pointing triangles! Iprorv et all f2005L 
z = 0.75 - 1.25, z = 1.75 - 2.25, z = 2.25 - 3, diamonds and squares). lMarchesini et aljj2009l z = 1.3 - 2, z = 2 - 3, right-pointing triangles). Model 
stellar mass functions are plotted at z = 0, 1, 1.5, 2.5 according to the label on each panel. We treat the specific IMF chosen in each measurement as part of 
the observational 'uncertainty' and do not convert them into the same IMF. 



